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ABSTRACT 

In this work, we study the problem of power allocation in teams. 
Each team consists of two agents who try to split their available 
power between the tasks of communication and jamming the nodes 
of the other team. The agents have constraints on their total energy 
and instantaneous power usage. The cost function is the difference 
between the rates of erroneously transmitted bits of each team. We 
model the problem as a zero-sum differential game between the 
two teams and use Isaacs ' approach to obtain the necessary condi- 
tions for the optimal trajectories. This leads to a continuous-kernel 
power allocation game among the players. Based on the commu- 
nications model, we present sufficient conditions on the physical 
parameters of the agents for the existence of a pure strategy Nash 
equilibrium (PSNE). Finally, we present simulation results for the 
case when the agents are holonomic. 

1. INTRODUCTION 

The decentralized nature of wireless ad hoc networks makes them 
vulnerable to security threats. A prominent example of such threats 
is jamming: a malicious attack whose objective is to disrupt the 
communication of the victim network intentionally, causing inter- 
ference or collision at the receiver side. Jamming attack is a well- 
studied and active area of research in wireless networks. Unautho- 
rized intrusion of such kind has started a race between the engineers 
and the hackers; therefore, we have been witnessing a surge of new 
smart systems aiming to secure modem instrumentation and soft- 
ware from unwanted exogenous attacks. 

The problem under consideration in this paper is inspired by recent 
discoveries of jamming instances in biological species. In a se- 
ries of playback experiments, researchers have found that resident 
pairs of Peruvian warbling antbirds sing coordinated duets when 
responding to rival pairs. But under other circumstances, cooper- 
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ation breaks down, leading to more complex songs. Specifically, 
it has been reported that females respond to unpaired sexual rivals 
by jamming the signals of their own mates, who in turn adjust their 
signals to avoid the interference 1271 . 

Many defence strategies have been proposed by researchers against 
jamming in wireless networks. A brief survey of various techniques 
in jamming relevant to our research is provided in [5]. In the past, 
networks with multiple attackers have also been considered in the 
literature. In |T5 16|, the authors consider the interaction between 
a source-destination pair, an eavesdropper, and friendly jammers. 
The source can buy "jamming power" from the friendly jammers 
which disguise the eavesdropper. This allows the source to achieve 
increased secrecy rate. The authors study the problem in the con- 
text of a Stackelberg game and show that a trade-off exists between 
the price announced by the jammers and the resulting performance. 
A similar problem was tackled in |9| where relay nodes can help 
the source in the presence of multiple eavesdroppers. The authors 
propose different relaying schemes and study two design problems: 
minimizing the transmission power subject to a minimum secrecy 
rate and maximizing the secrecy rate subject to a total power con- 
straint. Analytical analyses show that relaying yields improved per- 
formance when compared to direct transmission in malicious envi- 
ronments. Different from the aforementioned references, our work 
here considers non-friendly jamming teams, i.e., the security bot- 
tleneck considered here is jamming and not eavesdropping. More- 
over, to the best of our knowledge, pursuit-evasion strategies for 
jamming teams were not studied before. 

Ad hoc networks consist of mobile energy-constrained nodes. Mo- 
bility affects all layers in a network protocol stack including the 
physical layer as channels become time- varying 1 13,| . Moreover, 
nodes such as sensors deployed in a field or military vehicles pa- 
trolling in remote sites are often equipped with non-rechargeable 
batteries. Power control plays, therefore, a crucial role in design- 
ing robust communications systems. At the physical layer, power 
control can be used to maximize rate or minimize the transmis- 
sion error probability, see |18, 3 | and the references therein. In 
addition, in multi-user networks, power control can be used to reg- 
ulate the interference level at the terminals of other users 1241 1281 
1101 . Due to the lack of a centralized infrastructure in ad hoc net- 
works, distributed solutions are essential. In this work, similar to 
O 24, 28 1, we model the power allocation problem as a noncoop- 
erative game, thus allowing us to devise a non-centralized solution. 
As a departure from previous research, however the power control 
mechanism we propose splits the power budget of each player into 
two portions: a portion used to communicate with team-mates and 
a portion used to jam the players of the other team. More impor- 



tantly, the objective function is chosen to be the difference between 
the cumulative bit error rate (BER) of each team; this allows for 
increased freedom in choosing physical design parameters, besides 
the power level, such as the size of modulation schemes. 



{1",2''} and Team Bis comprised of the two players {1^2''}. The 
agents move on a plane and therefore, have two degrees of freedom 
{x,y). The dynamics of the players are given by the following 
equations: 



In the past year, there have been reports of predator drones being 
hacked 1141 1201 . resulting in intruders gaining access to classified 
data being transmitted from an aircraft. Motivated from such in- 
cidents, we have analyzed various scenarios of evading jamming 
attacks among autonomous agents. In the case of a single jammer 
trying to intrude the communication link between a transmitter and 
a receiver, the problem can be formulated as multiplayer (specifi- 
cally, three player) pursuit-evasion game I17l l2l. In O, we inves- 
tigated the problem of finding motion strategies for two unmanned 
autonomous vehicles (UAVs) to evade jamming in the presence 
of an aerial intruder. We considered a differential game theoretic 
approach to compute optimal strategies by a team of UAVs. We 
formulated the problem as a zero-sum pursuit-evasion game. The 
cost function was picked as the termination time of the game. We 
used Isaacs ' approach to derive the necessary conditions to arrive at 
the equations governing the saddle-point strategies of the players. 
In (7), we extended the previous analysis to a team of heteroge- 
neous vehicles containing UAVs and autonomous ground vehicles 
(AGVs). In 1 4 1, we analyzed the problem of multiple jammers in- 
truding the communication network in a formation of UAVs. In 
1,6,1, we analyzed the problem of connectivity maintenance in multi- 
agent systems in the presence of a jammer. In this current work, 
we study a scenario where a team of malicious nodes launch a 
jamming attack on another team, which is capable of jamming as 
well. We again use differential game theory to study the pursuit- 
evasion strategies of the teams, which constitute of uimiaimed de- 
cision makers (UDMs). 

The main contributions of this paper are as follows. To the best of 
our knowledge, this is the first work that considers the problem of 
two teams of mobile autonomous agents jamming each other. Our 
analysis takes into consideration constraints in energy and power 
among the agents. Moreover, we relate the problem of optimal 
power allocation for communication and jamming to the commu- 
nication model between the agents. Finally, we provide a sufficient 
condition for existence of an optimal decision strategy among the 
agents based on the physical parameters of the problems. 

The rest of the paper is organized as follows. We formulate the 
problem in Section|2]and explain the underlying notation. In Sec- 
tion[3] we introduce and solve an associated optimal control prob- 
lem. The Nash equilibrium properties of the team power con- 
trol problem are studied in Section O and the specific example of 
systems employing uncoded M-quadrature amplitude modulations 
(QAM) follows in Section [5] Simulation results are presented in 
Section |6] We conclude the paper and provide future directions in 
Section]?] An Appendix at the end includes explicit expressions for 
some of the variables introduced in the paper. 

2. PROBLEM FORMULATION 

Consider two teams of mobile agents. Each agent is communicat- 
ing with members of the team it belongs to and at the same time, 
jamming the communication between members of the other team. 
We consider a scenario where each team has two members, though 
at a conceptual level our development applies to higher number 
of team members as well. Team A is comprised of the two players 



• Team A: 

• TeamB: 

In the above equations, xi and ui denote vectors representing the 
state and control input of agent i, with the superscript (a or b) 
identifying the corresponding team. The state space of the entire 
system is represented by X~ 9? x 9? x x R^. Moreover, 
Ui G ~ {0 : [0, t] — >■ Ai I <!>{■) is measurable}, where 
Ai C R''* . / : R^ X j4i xR ^ R is uniformly continuous, bounded 
and Lipschitz continuous in x, for fixed ui. Consequently, given 
a fixed Ui(-) and an initial point, there exists a unique trajectory 
solving (TJ and ([2} 1 1 1. 

Now, we describe the physical layer communications model in the 
presence of a jammer which is motivated by 1261 . For each trans- 
mitter and receiver pair, we assume the following communications 
model. Given that the transmitter and the receiver are separated by 
a distance d, and the transmitter transmits with constant power Pt, 
the received signal power Pr is given by 

Pr = pPrd-", (3) 

where p depends on the antennas' gains and, according to the free 
space path loss model, is given by: 

(47r)2 ' 

where A is the signal's wavelength and Gt, Gr. are the transmit and 
receive antennas' gains, respectively, in the line of sight direction. 
In real scenarios, p is very small in magnitude. For example, using 
nondirectional antennas and transmitting at 900 MHz, we have p = 
= 6.896 X 10"''. 

The signal-to-interference ratio (SINR) s is given by 



where a is the ambient noise level. The Bit Error Rate (BER) is 
given by the following expression: 

Pit) = g{s), (5) 

where g{-) is a decreasing function of s. Explicit expressions for 
g{-) are provided in Section V where we consider the example of 
M-QAM. Each player uses its power for the following purposes: 
(1) Communicating with the team-mate, and (2) Jamming the com- 
munication of the other team. We assume that fa and are the 
frequencies at which Team A and Team B communicate, respec- 
tively, and fa 7^ fb- 



For an initial position xo G X, the outcome of tiie game n, is given 
by thie following expression: 

^(xo,<,ug,u^u^) = r [pUt)+p^t)-p\{t)-pUt)]dt, 
Jo ■■ . ' 

L 

where Pi{t) and Pi{t) are the BERs for agent i in team A and team 
B, respectively, and T is the time of termination of the game, pi 
depends in Si, i.e, the SINR perceived by agent i. From (O, Si 
depends on the mutual distances between the players. Therefore, 
we can see that the outcome functional, tt, depends on the state of 
the players and hence, their control inputs. The outcome functional 
models the difference in the erroneous communication packets ex- 
changed between the members of the same team during the entire 
course of the game. The objective of team A is to minimize tt and 
the objective of team B is to maximize it. 

Let Pi{t) and Pi{t) denote the instantaneous power levels for 
communication used by player i in Team A and Team B, respec- 
tively. Since the agents are mobile, there are limitations on the 
amount of energy available to each agent that is dictated by the ca- 
pacity of the power source carried by each agent. We model this 
restriction as the following integral constraint for each agent 

/ Pi{t)dt < E. (6) 
Jo 

The game is said to terminate when any one agent runs out of 
power, that is (6) is violated. 

In addition to the energy constraints, there are limitations on the 
maximum power level of the devices that are used onboard each 
agent for the purpose of communication. For each player, this con- 
straint is modeled by the following set of inequalities: 

0<Pt{t),PHt)<Pr^a.. (7) 

At every instant, each agent has to decide on the fraction of the 
power that needs to be allocated for communication and jamming. 
Table [T] provides a list of decision variables for the players that 
models this allocation. Each decision variable is a non-negative 
real number and lies in the interval [0, 1]. The decision variables 
belonging to each row add up to one. The fraction of the total 
power allocated by the player in row i to the player in column j 
is given by the first entry in the cell (i, j). This allocated power 
is used for jamming if the player in column j belongs to the other 
team; otherwise, it is used to communicate with the agent in the 
same team. Similarly, the distance between the agent in row i and 
the agent in column j is given by the second entry in cell (i, j). 
Since distance is a symmetric quantity, d'^ = d^' and d^j — dj^. 

Figure[T]summarizes the power allocation between the members of 
the same team as well as between the members of different teams. 

In the above game, each agent has to compute the following vari- 
ables at every instant: 

1. The instantaneous control, Ui(f). 

2. The instantaneous power level, Piit). 

3. All the decision variables present in the row corresponding 
to the agent in Table [T] 



Table 1: Decision variables and distances among agents. 
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Figure 1: Power allocation among the agents for communica- 
tion as well as jamming. 

In the next section, we analyze the problem of computing the opti- 
mal controls for each agent. 

3. OPTIMAL CONTROL PROBLEM 

From the problem formulation presented in the previous section, 
we can conclude that the objective functions of the two teams are 
in conflict. The tuple (ui*, U2*, uj*, U2*) is said to be optimal (or, 
in pair-wise saddle-point equilibrium) for the players if it satisfies 
the following conditions: 

7r[X0,Ui ,U2 ,Ui,U2j <7r[X0,Ui ,U2 ,Ui ,U2 ] (8) 
7r[X0,Ui ,U2 ,Ui ,U2 I < 7r|X0,Ui,U2,Ui ,U2 J (9) 

In simple terms, the above equations imply that agents in Team 
A are solving a joint optimization problem of minimizing the out- 
come. Similarly, agents in Team B are solving a joint optimization 
problem of maximizing the outcome. Moreover, the two teams are 
playing a zero-sum game against one another. In this case, the value 



of the game, denoted by the function J : X — >■ R, can be defined 
as follows: 

J(x) = 7r[xo,ur,U2*,u?*,U2*] (10) 

The value of the game is unique at a point X in the state-space. 
An important property satisfied by the value of the game is the 
Nash equilibrium property. The tuple (u"*, U2*, uj*, U2*) is said 
to be in Nash equilibrium if no unilateral deviation in strategy by 
a player can lead to a better outcome for that player. Hence, there 
is no motivation for the players to deviate from their equilibrium 
strategies. In terms of the outcome of the game, the strategies 
(ui * , U2* , Ui* , U2* ) are in Nash equilibrium (for the 4-player game) 
if they satisfy the following property: 



7r[xo,Ui' 



U2 ,Ui 

h 



A] 
.6*1 



7r[Xo,Ui ,U2 ,Ui,U2 



7r[Xo,Ui ,U2 



b* 



U2 



< 7r|X0,Ui ,U2 ,Ui ,U2 J 



7r[xo,u?,ur,u5*,ur] 

r a* a b* b*i 

7r[X0,Ui ,U2,Ui ,U2 J 



(11) 



In general, there may be multiple sets of strategies for the players 
that are in Nash equilibrium. Assuming the existence of a value, 
as defined in dlOb . and the existence of a unique Nash equilibrium, 
we can conclude that the Nash equilibrium concept of person-by- 
person optimality given in (TTJ is a necessary condition to be satis- 
fied for the value of the game. Further, obtaining the set of strate- 
gies that are in Nash equilibrium yields the optimal strategies for 
the players. In the following analysis, we assume the aforemen- 
tioned conditions in order to compute the optimal strategies. 

The Hamiltonian of the system is given by the following expres- 
sion: 

H = L + V J ■ /(x) 

= P?W+P2W-p?(i)-P2(i) + VJ-/(x) (12) 

In order to compute the optimal control of the players, we will use 
the Isaacs ' conditions U7< which are the following: 



1. 



H[xo,U-i ,U2 ,Ui ,U2] I „r a* a, b. b* 

UW .,"* .."* .,6 ..fc*! r S-H[Xo,Ui ,U2 ,Ui ,U2 
ii Xo,Ui ,U2 ,Ui,U2 



i7[X0,Ui ,U2 ,Ui ,U2 ] < 



//[xo,u?,ur,u?*,ur] 
//[xo,ur,uS,u?*,ur] 



Additionally, the gradient of the value function satisfies the retro- 
gressive path equations (RPE) given by the following partial dif- 
ferential equation: 

aVJ _ dH_ 
dr dx ' 

where r is the retrograde time or time left for termination. 
The RPE leads to the following equations for the players. 

Team A 



J:r» = VJ 



+ 



(4)-(.+2) J 



dya L (rfl2)2 



TeamB 

J^b = V J ■ ^^-r a 



(rf.)-{.+2) 



(df)~("+2) 

E 



(di2) 



+ 



(dfe)-(<^ + 2) J 

Here, (!) denotes derivative with respect to retrograde time. Since 
termination is only a function of the power of each player, J is in- 
dependent of the position of the players on the terminal manifold. 
Therefore, V J = at termination. This forms the boundary con- 
dition for the RPE. 

In the next section, we address the problem of power allocation. 

4. POWER ALLOCATION 



2. ji-[xo,ur,ur,u?-,ur] = 

The agents in Team A want to minimize the Hamiltonian at every 
instant, and the agents in Team B want to maximize it. The dy- 
namics of the agents are decoupled. Therefore, the Hamiltonian is 
separable in its controls and, hence, the order of taking the extrema 
becomes inconsequential. As a consequence, the optimal controls 
of the players are given by the following expression from Isaacs' 
first condition. 



a* a* b* b* 
Ui ,U2 ,Ui ,U2 



arg max min H 



The optimal control Ui is obtained by the following expression: 



ur =min„a |i-/f(x?,<,f) 



max„.^./i'(x?,u^t) 



1,2 



From the SINR received by each agent in terms of the power 
levels of the other agents as well as their mutual distances is given 
by the following expressions 

P2''W7'^(rf^^)"° 



S2 = 



b 

Si = 



pnth'^id 



12,'Tl2\-a 



^ + Pt{t)5Udl)-" + p^mic[iy 



P|(i)52i(di2)- 



^ + (t)7j K)- + P2" W7?(d?)-" 



4 = 



PHt)Si2{di 



^ + P?{thUdl)-^ + P2" (i)7l(di)-'^ 



(13) 



From the expression in ( 112b . we can conclude that the power allo- 
cation among the agents only affects the term L in the Hamiltonian. 



Therefore, Isaacs' first condition leads to the following power allo- 
cation problem among the agents. 



The next corollary then follows regarding the time horizon of the 
differential game. 



4.1 Team A 

The objective of each agent is to minimize L. 
1. Player 1: 




subject to: 

PfW < Pmax 
1, 1, 12 , 1 1 12 ^ri^ ^a3 

71+72+7 =1, 7i,72,7 >0=>7eA 
2. Player 2: 



min L 



PS 



il-vX-pl) (15) 



subject to: 



P2{t) < Pmax 
2 I 2 I 21 -1 11 21 ^ ri > ^ A 3 

71 +72 +7 = 1. 71.72, 7 > => 7 G A 

4.2 Team B 

The objective of each agent is to maximize L. 



1. Player 1: 



max max (Pi+P2^P2) (16) 



Corollary 1 . The entire game terminates in a fixed time T = 
irrespective of tlie initial position of the agents. 

Proof. From Theorem [T] and (|6}, we conclude that the T = 
for all players. All the players consume their entire energy at 
the same time, i.e., T. Therefore, the game ends at time T. □ 

Now we consider the problem of computing the optimal value of 
the decision variables for the players. In order to do so, we use 
preexisting results from continuous kernel games that are presented 
here in Theorem 2 and Theorem 3 and are stated without proof. 

Theorem 2. f2^ An N-person nonzero-sum game in which the 
finite-dimensional action spaces [/* (i G N) are compact and cost 
functionals J' (i € N) are continuous on X ■ ■ ■ X admits a 
mixed strategy Nash equilibrium (MSNE). 

From the above theorem, we can conclude that the power alloca- 
tion game has a Nash equilibrium in mixed strategies since the de- 
cision variables of each player lie on a simplex which is compact. 
Moreover, L is a continuous function of the decision variables of 
all the players. Therefore, the game admits a MSNE. Although, the 
MSNE has been computed for some games by exploiting some spe- 
cial characteristics in the cost functions, there are no standard tech- 
niques to compute MSNE for general continuous-kernel games 1221 
l2l. Therefore, we search for the conditions under which the power 
allocation game admits a pure strategy Nash Equilibrium (PSNE). 



subject to: 



2. Player 2: 



max i => max {Pi + P2 ^ Pi) (17) 
subject to: 

Pbit) < P„,ax 
Sl + $2 + S21 = 1, (52,(52,521>0 

Since the players do not communicate, they possess information 
only about their own decision variables. This makes the power 
allocation problem a continuous kernel non-zero sum game among 
the players. 

Theorem 1. The optimal value of the power consumption for 
each player is Pmax. 

Proof. Consider Player 1". The rate is a decreasing func- 
tion of Pa{t). Also, pi and p2 are increasing functions of Pa{t). 
Therefore, (p2 ^ Pi ^ P2) is a decreasing function of (t) . Hence 
the optimal value of Pa{t) — Pmax- Using the same argument for 
the other players leads to the conclusion that the optimum level of 
power consumption of every player is Pmax- □ 



Theorem 3. ^ Let U be a closed, bounded and convex sub- 
set o/R'", and for each i G N the cost functional ,P : U — >■ R fee 
continuous on U and convex in u' for every u-' ^ U-' , j (z N, j ^ i. 
Then, the associated N-person nonzero-sum game admits a PSNE. 

The above theorem provides the conditions under which we can 
guarantee existence of a PSNE. Let us consider the case of agent 
1". The expressions for SINR provided in il3\ relevant to the op- 
timization problem being solved by 1" can be written in a concise 
form as shown below: 



a 12 b bl f, dl 

S2=ai"i , Si = - — ^ — -, S2 



where 

ai = 



ci + 7( ei + 72^ 

1 



+ 



bl = 521 

a 2(1^^"" 



di2 

dl I ' 

o- , 2 fdl 

ei = r7Tw7+ 72^71 



Pmax p{d: 



Note that ai,bi, ci, di and ei are independent of the decision of 

r. 



Theorem 4. The power allocation team game has a unique 
Nash equilibrium in pure strategies if the following conditions hold 
for Team A: 



To this end, we obtain: 



g"{sl) > 0, 

5"(^?) + |(c,+7i)s'(s?)<0, 
2 

9"{sl) + ^('2' + 72)5'(S2) < 0, 
and equivalent conditions hold for Team B: 

9"{s\) > 0, 
2 

9"{4) + -r{mi + ^l)9'{si) < 0, 
g"{s2) + -{o. + Sl)g'is^,)<0, 



(18) 
(19) 

(20) 



(21) 
(22) 

(23) 



V/ii(7) 



aiS'(s2) 

fcl3'(«?) 
(ci+7j)2 

''19'(4) 
(ci+7i)2 



,V/l2(7) 



,Vft(7) = 



,v;i3(7) 



Since 7 G A'^, at most tiiree of the constraints can be active at any 
given point. Hence, the gradient of the constraints at any feasible 
point are always linearly independent. 

If two of the three constraints among {/ii , /i2 , ^3 } are active, then 7 
has a unique solution that is given by the vertex of the simplex that 
satisfies the two constraints. If only one of the constraints among 
{hi,h2, /13} is active, then we have the following cases depending 
on the active constraint 



where i £ {1, 2}. 



The constants 62, C2, d2, e^, k, rrii, Ui, and Oi are obtained by re- 
writing the SINR expressions as done above; their expressions can 
be found in the Appendix. 

Proof. Let us consider the case of I"'. From Theorem 2, we 
can conclude that a pure strategy Nash equilibrium exists if is 
convex in its arguments when the decision variables of the other 
players are fixed. From |8 |, LI is convex if and only if V^LJ > 
(For Team B, is concave if and only if V^L^ < 0), where the 
Hessian V^Lf is given in l|24j. This constitutes that 5"(s§) > 
0,9"{s\) + i^{c^Wl)9'{s\) < 0,and3"(s^) + ^(ei+72^)<;'(s^) • 
0. The theorem then follows by following similar steps to verify 
Li < 0, and V^La < 0. □ 



I. /ii(7^) = 0: 7^ = (0,7i*, 1 — 7i*) satisfies the following 
equations 



9 {S2)- 



(25) 



2. ^2(7^) = 0: 7^ = (1 — 72*, 0,72*) satisfies the following 
equations 

di9'{s2) 



(«2) = 



(ei+72^*)^ 



(26) 



3. /i3(7^) = 0: 7^ = (1 - 7i*, 7^*, 0) satisfies the following 
equations 

big'isl) 



(27) 



Applying the KKT conditions 1191 , in addition to the assumptions 
provided in the theorem that guarantee strict convexity of LI, gives 
us the following equations that need to be satisfied by the globally 
unique optimal solution (7): 



VLJ (7) + ^»Vfe.(7) + r?V/i(7) = 

1=1 

A. ft, (7) = 
A„r?> 



J e {1,2,3} 



where 



^2(7) 
^3(7) 
ft(7) 



-7'' < 

-ll < 

-72 < 
12 I 1 
7 +7i + 



1 

72 



1 = 



Now, we present the necessary and sufficient conditions for the so- 
lution to the optimization problem for the agents. Let us consider 
the case of 1". The assumptions in Theorem 4 regarding strict con- 
vexity of LJ render the KKT conditions to be necessary as well as 
sufficient conditions for the unique global minimum. 



If none of the inequality constraints are active, then 

-4 /, 1* 1* 1* 1*N 

7 =(l__7i_72 ,7i .72 ), 

is the solution to the following equations: 

bi 



ai9 (S2) 
0.19 (S2) 



[ci + 7i' 



:jzt29 (S2) = 



(28) 



[ei + 72*] 

Here, 7 lies in the set {(1, 0, 0), (0, 1, 0), (0, 0, 1), 7\ 7^ 7^ 7''}. 

An important point to note is that a\,b\,c\,d\ and e\ depend on 
the decision of the other players. Therefore, the computation of 
the decision variables depend on the value of the decision variables 
of the rest of the players. A possible way to deal with this prob- 
lem is to use iterative schemes for computation of strategies. |2] 
provides some insights into the efficacy of such schemes from the 
point of view of convergence and stability. In this work, we assume 
that each agent has enough computational power so as to complete 
these iterations in a negligible amount of time compared to the total 
horizon of the game. 








-[9"{sl) + ^{c^+^fl)g'{s\)] 



(24) 



-(^7^[ff"(«2) + |-(ei+72^)ff'(s|)] 



In the next section, we express the conditions for the existence of 
PSNE in terms of limitations imposed by the physical communica- 
tions layer. 

5. EXISTENCE OF PSNE UNDER M-QAM 
MODULATION SCHEMES 

The bit error rate (BER) depends on the SINR, the modulation 
scheme, and the error control coding scheme utilized. Communica- 
tions literature contains closed-form expressions and tight bounds 
that can be used to calculate g{s) when the noise and interference 
are assumed to be Gaussian 1 12|. For example, using uncoded M- 
QAM, where Gray encoding is used to map the bits into the sym- 
bols of the constellation, the BER can be approximated by 1231 



log(M) 



(29) 



where = 4(1 - 1/VM), /? = 3/(M - 1), and Q(.) is the 
tail probability of the standard Gaussian distribution which can be 
expressed in terms of the error function: 



Qix) 
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The conditions of Theorem |4] depend primarily on the employed 
modulation and coding schemes. 

Theorem 5. When all players employ uncoded M-QAM mod- 
ulation schemes, the power allocation team game has a unique 
PSNE solution if the following condition is satisfied: 



PpPmax (min{d^^,di2}) ° < 3cr. 



(30) 



Proof. The conditions of Theorem |4] need to be satisfied for a 
unique pure strategies solution to exist. We first verify those condi- 
tions for Player 1" when uncoded M-QAM modulations are used. 
To this end, we differentiate l |29t in to obtain: 



9"{s) = 



2 log (M) A 

C^/?(l + /3^ 
41og(M)\/2^ 



: exp 



: exp 



/3 



(31) 



From on . we conclude that condition l llSb holds for any value of 
fli and 7^^^. Condition l |19t holds given that 



/3l 

cr > -b. 



- 7i, 



or equivalently 



which we can re-write as 



> ■g'52i(di2) 



(32) 



" >f(di2)- 



(33) 



PmaxP 3 

By following similar steps, we can show that l |33l l is sufficient for 
i20t to hold. In fact, condition i33t is also sufficient for the con- 
vexity of Lg. For Team B, a sufficient condition for the concavity 



of L\ and L| is 



Pmaxp 3 



, Pr-ll2\-a 

> irid ) , 



(34) 



which can be derived following similar steps to the above. The 
theorem follows from i33i and l |34| (. □ 



Note that the left hand side of inequality OOt depends entirely 
on physical design parameters; this is of particular importance for 
design purposes. Moreover, sufficient conditions for Theorem [5] 
can be expressed in terms of the received SNRs for all players, 
which could be more insightful from a communication systems 
prospective. Consider, for example. Player 1°, and let SNR^ = 



and SNR^ 
J32| | can then be written as 



Expression 



SNR21 < |(SNRf + 1). 
Similarly, condition jlO) holds if 

SNR12 < |(SNRi + l). 

Yet another useful way to interpret condition l l3Qt is regarding it as 
a minimum rate condition: 



i? > log 1 + 



pPm,ax (min{d^^,di2})" 



where _R = log(M). 

The specific conditions for Player 1" corresponding to Il25t-i27b 
when M-QAM modulations are utilized are: 
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Also, ( 1281 ) in this case becomes 
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(a) Trajectories of the agents 
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Figure 2: 
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Figure 3: Trajectories of the agents on the plane 
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(c) Values of the decision variables 
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6. SIMULATION RESULTS 

In this section, we present some simulation results. First, we present 
simulations for arbitrarily chosen values of maximum power Pmax, 
frequencies /i, /2, modulation scheme size M, and speeds of the 
players u satisfying the convexity conditions in the previous sec- 
tion. We then change the value of one parameter, while fixing the 
rest, and present simulation results for the variants of the original 
problem. 

Figure [2] shows the trajectories of the agents, distances between 
agents, and the values of the decision variables over 200 time steps. 
The kinematics of all the agents are given by the following equa- 
tions 

Xi = Ui cos 9i, yi ~ Ui sin 0i 
We used the following values for the parameters in the simulations: 



• Pmax = 100 

• /i = 300 MHz and f-2 



100 MHz 



• Typical values of M are 2,4,16, 64, and 256. For simulation 
purposes, we fix the value of M at 2. 

• u1^ul=u'i^ul = l 



Figures [3] m and |5] contain four subfigures each repeating one of 
the above simulations for a variation of the original parameters as 
follows: 



1. Different Speeds: «f = 5, = 1, u\ = 3, and = 4 

2. Different Modulation: M = 16 

3. Different terminal conditions. 

4. Frequency exchange: /i — lOOMHz and /2 = 300MHz 

In each case, we present the simulation results when all the param- 
eters are fixed except for the one listed above. 

7. CONCLUSION AND FUTURE WORK 

This paper has studied the power allocation problem for jamming 
teams. The motion of the teams was modelled using the framework 
of pursuit-evasion games and the optimal strategies were derived. 
An underlying static game was used to obtain the optimal power 
allocation, where the power budget of each user is split between 
communication and jamming powers. This work focused on the 
analysis of teams consisting of two players only. Potential future 
directions include: 



• Computation of Singular Surfaces: In this work, we have 
computed the trajectories based on the necessary conditions 
of optimality imposed by the Isaacs' conditions. In order 
to complete the construction of the optimal trajectories of 
the agents, we have to identify the singular surfaces in the 
state space [2]. This is an interesting future research direc- 
tion since the construction and nature of the singular surfaces 
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Figure 4: Variations of the decision variables as a function of time. 




would depend on the value of the decision variables obtained 
from the power allocation game. 

• Computation ofMSNE: As discussed in Section 3, the power 
allocation games admits a MSNE without any constraints on 
the underlying communication model. An important future 
problem is to compute the MSNE for the power allocation 
game. 

• Scheduling Schemes: An interesting direction would be ex- 
ploring scheduling algorithms, similar to the one proposed in 
nil , in which players take turns in communicating or jam- 
ming. For example, the users of a given team that are closest 
in distance to the the other team could allocate all their re- 
sources to jamming, while the other users allocate all their 
resources to communicating with each other. 

• Power Control: When multiple users are present, and due to 
the broadcast nature of wireless systems, networks become 
interference-limited. The transmission power of one user can 
impede the links between other nodes due to the interference; 
hence, it is important to regulate the transmission power of 
the users in order to, for example, maximize the total capac- 
ity of the network. 

• Routing: Multihop routing improves the total throughput and 
power efficiency of a network through relaying packets via 
intermediate nodes to their final destination. Because a por- 
tion of the energy of each node has to be allocated to jam the 
other team, determining the optimal route for transmission 
becomes a challenge, especially in the presence of mobility. 
An investigation of routing protocols in the context of games 
is therefore essential for studying the overall performance of 
the networks 1251 . 

• Eavesdropping: When fa ~ ft, another security issue arises 
as the ADMs of a given team can receive and decode mes- 
sages intended for internal communications of other teams. 
To ensure secure communications, each team would need to 
allocate power to jam the eavesdroppers. In fact, a more 
general scenario is when adversarial teams consist of active 
eavesdroppers: malicious nodes that can act as jammers and 
eavesdroppers 1211 . 
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APPENDIX 



The following are the expressions of the quantities appearing in 
^S^-^ and in V^L^, V^L?, and V'^L^: 
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